optimistic algorithm
Country:
- North America > United States > California > Santa Clara County > Palo Alto (0.04)
- North America > Canada (0.04)
Country:
- North America > United States > California > Alameda County > Berkeley (0.04)
- North America > Canada (0.04)
- Asia > Middle East > Jordan (0.04)
Country:
- North America > United States > Texas (0.04)
- North America > Canada > Alberta (0.04)
Technology:
Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.83)
Technology:
Country:
- North America > United States > Massachusetts > Middlesex County > Belmont (0.04)
- North America > United States > California > Los Angeles County > Long Beach (0.04)
- North America > United States > California > Alameda County > Berkeley (0.04)
Tractable Instances of Bilinear Maximization: Implementing LinUCB on Ellipsoids
Zhang, Raymond, Hadiji, Hédi, Combes, Richard
We consider the maximization of $x^\top θ$ over $(x,θ) \in \mathcal{X} \times Θ$, with $\mathcal{X} \subset \mathbb{R}^d$ convex and $Θ\subset \mathbb{R}^d$ an ellipsoid. This problem is fundamental in linear bandits, as the learner must solve it at every time step using optimistic algorithms. We first show that for some sets $\mathcal{X}$ e.g. $\ell_p$ balls with $p>2$, no efficient algorithms exist unless $\mathcal{P} = \mathcal{NP}$. We then provide two novel algorithms solving this problem efficiently when $\mathcal{X}$ is a centered ellipsoid. Our findings provide the first known method to implement optimistic algorithms for linear bandits in high dimensions.
2511.07504
Country:
- North America > United States > New York > New York County > New York City (0.04)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- Europe > France (0.04)
Technology: